Synthetic ALSPAC longitudinal datasets for the Big Data VR project
نویسندگان
چکیده
Three synthetic datasets - of observation size 15,000, 155,000 and 1,555,000 participants, respectively - were created by simulating eleven cardiac and anthropometric variables from nine collection ages of the ALSAPC birth cohort study. The synthetic datasets retain similar data properties to the ALSPAC study data they are simulated from (co-variance matrices, as well as the mean and variance values of the variables) without including the original data itself or disclosing participant information. In this instance, the three synthetic datasets have been utilised in an academia-industry collaboration to build a prototype virtual reality data analysis software, but they could have a broader use in method and software development projects where sensitive data cannot be freely shared.
منابع مشابه
Young people’s views about the purpose and composition of research ethics committees: findings from the PEARL qualitative study
BACKGROUND Avon Longitudinal Study of Parents and Children (ALSPAC) is a birth cohort study within which the Project to Enhance ALSPAC through Record Linkage (PEARL) was established to enrich the ALSPAC resource through linkage between ALSPAC participants and routine sources of health and social data. PEARL incorporated qualitative research to seek the views of young people about data linkage, ...
متن کاملSex discordance in asthma and wheeze prevalence in two longitudinal cohorts
Sex discordance in asthma prevalence has been previously reported, with higher prevalence in males before puberty, and in females after puberty; the adolescent "switch". However, cross-sectional studies have suggested a narrowing of this discordance in recent decades. We used a combination of cross-sectional and longitudinal modelling to examine sex differences in asthma, wheeze and longitudina...
متن کاملReplicating the Synthetic LBD with German Establishment Data
One major criticism against the use of synthetic data has been that the efforts necessary to generate useful synthetic data are so intense that many statistical agencies cannot afford them. However, we argue in this paper that the field is still evolving and many lessons that have been learned in the early years of synthetic data generation can now be used in the development of new synthetic da...
متن کاملYoung people's views about consenting to data linkage: findings from the PEARL qualitative study.
BACKGROUND Electronic administrative data exist in several domains which, if linked, are potentially useful for research. However, benefits from data linkage should be considered alongside risks such as the threat to privacy. Avon Longitudinal Study of Parents and Children (ALSPAC) is a birth cohort study. The Project to Enhance ALSPAC through Record Linkage (PEARL) was established to enrich th...
متن کاملObesity in adolescents with chronic fatigue syndrome: an observational study
OBJECTIVE Identify the prevalence of obesity in patients with chronic fatigue syndrome (CFS) compared with healthy adolescents, and those identified with CFS in a population cohort. DESIGN Cross-sectional analysis of multiple imputed data. SETTING Data from UK paediatric CFS/myalgic encephalomyelitis (CFS/ME) services compared with data collected at two time points in the Avon Longitudinal ...
متن کامل